Goto

Collaborating Authors

 aspiring data scientist


Machine Learning in Three Steps: How to Efficiently Learn It

#artificialintelligence

I have observed two extreme approaches when it comes to aspiring data scientists attempting to learn machine learning algorithms. The first approach involves learning all the intricacies of the algorithms and implementing them from scratch to gain true mastery. The second approach, on the other hand, assumes that the computer will "learn" on its own, rendering the need for the individual to learn the algorithms unnecessary. This leads some to only rely on tools such as the package lazypredict. It is realistic to take an approach between the two extremes when learning machine learning algorithms. However, the question remains, where to start? In this article, I will categorize machine learning algorithms into three categories and provide my humble opinion on what to begin with and what can be skipped. Starting out in machine learning can be overwhelming due to the multitude of available algorithms. Linear regression, support vector machines (SVM), gradient descent, gradient boosting, decision trees, LASSO, ridge, grid search, and many more are some of the algorithms that come to mind when posed with the question.


Unlock the Power of Synthetic Data: A Guide for the Aspiring Data Scientist

#artificialintelligence

Data for Machine Learning Model is the heart of the AI world. But many companies struggle to find adequate data to build a successful model. That's where Synthetic Data comes in. Synthetic Data is generated using different techniques, some of which are statistical methods, deep learning methods, open source technologies and so on. Synthetic Data has many benefits, such as being beneficial for companies lacking data, being able to generate non-dominating class data, allowing for the generation of data without using PII and aiding autonomous vehicle companies.


January Edition: Becoming Better Learners

#artificialintelligence

Daily, Weekly, Monthly, and Yearly Goal Tips to Guide a Self-Taught Data Scientist in 2023 (December 2022, 11 minutes) A good plan is key to reaching your learning goals, and Madison Hunter is here to help with a robust roadmap for building one that is both ambitious and sustainable. How to Explore Machine Learning and Natural Language Processing as a High School Student (July 2022, 12 minutes) This helpful guide by Carolyn Wang might be framed around her own experience as a high school student, but it's a helpful introduction to ML and NLP for aspiring practitioners of all ages. The Simple Things a Data Science Beginner Needs to Know (December 2022, 11 minutes) Ken Jee's recent resource is an accessible, up-to-date primer for anyone taking their first steps in data science this year. Here Are My 3 Suggestions for Newcomers (April 2022, 5 minutes) For all the independent learners out there who choose not to follow an established curriculum, Soner Yıldırım offers a few key insights based on his own experience as a self-taught data professional. A Brief Introduction to Neural Networks: A Regression Problem (December 2022, 12 minutes) How do you go about learning a complex technical topic from scratch?


Machine Learning for Aspiring Data Scientists: Zero to Hero

#artificialintelligence

This course will teach you the foundations of machine learning. The content was especially designed to help you pass machine learning interviews for data science jobs. In academic courses, your teacher spends hours speaking about calculus and linear algebra, but then none of that comes up in a job interview! That in-depth knowledge certainly has a place but is not what most companies are looking for. In bootcamps you tend to learn how to use many tools but not how they work under the hood.


Importance of Data Science and Artificial Intelligence in education sector

#artificialintelligence

Meet Aswini Thota, an Analytics and Artificial Intelligence (AI) leader who solves organisational and business problems leveraging data. He always believed in the power of data and amased what insights we can grasp from it. Over the course of his career, Aswini has developed a skill set in analysing data and he hopes to use his experience and expertise in data science to help people discover the amazing career opportunities that lie ahead in the field of Data Science. He has effectively evolved from a machine learning researcher to an award-winning AI / Data science leader. Aswini holds two master's degrees in Electrical Engineering and Data Science.


How I Tripled My Income With Data Science in 18 Months - KDnuggets

#artificialintelligence

Around 18 months ago, I lost my job due to the COVID-19 pandemic. I was working as a part-time tutor while in college. The money I got from tutoring was used to cover expenses like food, petrol, and my car. After the government imposed lockdown restrictions on the entire country, I was unable to continue teaching. I couldn't go to college either and had to study at home.


Blog: Top Math Resources for Data Scientists

#artificialintelligence

At some point, every aspiring data scientist has to get familiar with mathematics for machine learning. To be blunt, the more serious you are about data science, the more math you'll need to learn for machine learning. If you have a strong math background, this is likely to little issue. In my case, I've had to relearn much of the mathematics (note – I'm not done yet!) that I took at a university as my professional life had allowed my math skills to atrophy. Based on my experience teaching our bootcamp there is also a group of aspiring data scientists that fall into a category where their formal math training needs to be augmented.


10 Mistakes You Should Avoid as a Data Science Beginner - KDnuggets

#artificialintelligence

Data science is a success. The data science field is a very competitive market, especially to get one of the (supposed) dream jobs at one of the big tech companies. The positive news is that you have it in your hand to gain a competitive advantage for such a position by preparing yourself adequately. On the other hand, there are (too) many MOOCs, master programs, bootcamps, blogs, videos and data science academies. As a beginner, you feel lost. Which course should I attend? What topics should I learn?


Natural Language Processing With Transformers in Python

#artificialintelligence

Transformer models are the de-facto standard in modern NLP. They have proven themselves as the most expressive, powerful models for language by a large margin, beating all major language-based benchmarks time and time again. In this course, we learn all you need to know to get started with building cutting-edge performance NLP applications using transformer models like Google AI's BERT, or Facebook AI's DPR. Throughout each of these use-cases we work through a variety of examples to ensure that what, how, and why transformers are so important. Alongside these sections we also work through two full-size NLP projects, one for sentiment analysis of financial Reddit data, and another covering a fully-fledged open domain question-answering application.


In defense of statistical modeling

#artificialintelligence

Data science has been hot for many years now, attracting attention and talent. There is a persistent thread of commentary, though, that says data science's core skill of statistical modeling is overhyped and that managers and aspiring data scientists should focus on engineering instead. Vicki Boykis' 2019 blog post was the first article I remember along these lines. Don't do a degree in data science, don't do a bootcamp…It's much easier to come into a data science and tech career through the "back door", i.e. starting out as a junior developer, or in DevOps, project management, and, perhaps most relevant, as a data analyst, information manager, or similar… While tuning models, visualization, and analysis make up some component of your time as a data scientist, data science is and has always been primarily about getting clean data in a single place to be used for interpolation. More recently, Gartner's 2020 AI hype cycle report acknowledges the role of data scientists but says: Gartner foresees developers being the major force in AI.